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Abstract 

The Markov commutator associated to a finite Markov kernel P is the convex semigroup con¬ 
sisting of all Markov kernels commuting with P. Its interest comes from its relation with the 
hypergroup property and with the notion of Markovian duality by intertwining. In particular, it 
is shown that the discrete analogue of the Achour-Trimeche’s theorem, asserting the preservation 
of non-negativity by the wave equations associated to certain Metropolis birth and death transi¬ 
tion kernels, cannot be extended to all convex potentials. But it remains true for symmetric and 
monotone potentials which are sufficiently convex. 
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1 Introduction 


The primary motivation for this paper is to disprove, at least in a finite context, a conjecture due 
to Dominique Bakry, about an extension of Achour-Trimeche’s theorem pj (see also Bakry and 
Huet 0). It also provides the opportunity to begin a systematic study of the commutator convex 
semi-group associated to a Markov kernel. 

Here we will only be concerned with state spaces V which are finite and endowed with a Markov 
kernel P, namely a matrix (P(x,y)) Xtye v whose entries are non-negative and whose row sums are 
equal to 1. Two classical assumptions on P are: 

Irreducibility: all the coefficients of Xme[|v|] -R” are positive (|V| is the cardinality of V and we 
denote for any fc < Z e Z, pc, Z] := {k, k + 1,Z — 1, Z}, and [Ze] := [1, Ze] for k e N). 

Reversibility: there exists a probability measure // positive on V. such that 

\/x,yeV, n(x)P(x,y) = p(y)P{y,x) (1) 

Under the reversibility assumption, there exist orthonormal bases of L 2 (/r) consisting of eigen¬ 
vectors (pi, (pi,(p\y\ of P , associated to the eigenvalues 1 = 6 \ ^ 62 > ■ ■ ■ 5 s 0|vj ^ —1. Without 
loss of generality, we will always choose <p\ = 1. We say that P satisfies the hypergroup prop¬ 
erty with respect to a point xo e V, if the previous basis can be chosen such that pk(xo) =j= 0 for 
all k e l\V\}, and 


V x, y, z e V, 


E 

fcelTI] 


Pk(x)pk(y)pk(z) 

<Pk(x 0 ) 


^ 0 


( 2 ) 


These notions can be immediately extended to Markov generators L on V, namely matrices 
whose off-diagonal entries are non-negative and whose row sums vanish (for instance by considering 
the generated semi-group (Pt)t ^0 := (exp(ZL))^o and by asking that the above conditions are 
satisfied by Pt, for some t > 0, it does not depend on the choice of t > 0). Extensions to 
more general Markov processes are also possible, but they may require some care. E.g. in [3], 
Bakry and Huet consider one-dimensional diffusion generators of the form Ljj d 2 — U’d on 

[—1,1], with Neumann conditions on the boundary and where Z7 : [—1,1] 1 is a smooth 

potential. They prove Achour-Trimeche’s theorem [lj, asserting that if U is convex and either 
monotonous or symmetric with respect to 0, then Lu satisfies the hypergroup property. In a 
personal communication, Dominique Bakry was wondering if this result would remain true if the 
assumption “monotonous or symmetric with respect to 0” was removed. Our main objective is to 
show that this is wrong, at least in the finite setting. 

More precisely, let N e N\{1} be given and denote by C the set of functions U : [0, N} —> M 
which are convex (i.e. whose natural piecewise affine extension to [0,1V] is convex). For U e C, let 
Hu be the probability on [0, A r ] given by 

V x e|0,iVj, h u(x) := Z~' exp(-U(x)) (3) 


where Zu is the renormalizing constant. For any U e C, assume we are given an irreducible birth 
and death Markov transition Pjj on |0, A r J whose invariant probability is pu- Recall that a birth 
and death Markov transition P on [0, NJ is a Markov kernel such that 


V x,y e [0, IV], P(x,y)> 0 => \x - y\ ^ 1 

An invariant measure of such a kernel necessarily satisfies CD, so that an irreducible birth and 
death Markov matrix is reversible. 

Endowing C and the set of Markov kernels from the topology inherited respectively from 
and . we sa y that the above mapping C 3 U >—> Pu is a (birth and death) generalized 
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Metropolis procedure if it is continuous. A classical Metropolis procedure corresponds for 
instance to the Markov kernel Mjj defined by 


Vx + ye[0,A], M v (x,y) 


Mq(x, y) fU(x)-U(y)\ 

^r ew { — 2 — ) 


where the exploration Markov kernel Mq is given by 


v X 4= y e [[0, Nj, M 0 (x,y ) 


1/2 , if \x - y\ = 1 
0 , otherwise 


and where 




max 

xelO,N} 


S 

2/e [0,JV]\{a:} 


Mo{x,y) exp 


( U(x)-U(y) ^ 


(4) 

(5) 

( 6 ) 


As usual, the diagonal entries of the matrices Mu and Mq are imposed by the condition that the 
row sums are equal to 1. 

Our main result is: 


Theorem 1 It does not exist a generalized Metropolis procedure C 3 U i— > Pu such that Pu satisfies 
the hypergroup property for all U e C. 

In p3], we checked numerically (by appropriate random choices of U in C) that a variant of 
the classical Metropolis procedure (described as C 3 U >—> Mu with the notation introduced in (1331) 
below) does not satisfy the hypergroup property. 

The proof of Theorem[I]is based on properties of the commutator convex semi-group IC(P) 
associated to a Markov kernel P on V : it is the set of Markov kernels K on V commuting with 
P: KP = PK. It is immediate to see that it is convex and that it is a semi-group: if K and 
K' belong to JC(P), the same is true for their product KK'. It was introduced in [13], because it 
gives a simple Markovian characterization of the hypergroup property for certain kernels. More 
precisely, let us introduce the following objects: 

VseV, /C(P, x) := {K(x, •) : K e JC(P)} cr V(V) 
where V(V) is the convex set of probability measures on V, and 

H{P) = {xe V : JC(P,x) = V(V)} 


Furthermore, say that a Markov kernel is uniplicit if it is reversible and if all its eigenvalues 
are of multiplicity 1 (in particular the eigenvalue 1 is of multiplicity 1, so that uniplicity implies 
irreducibility). The interest of these notions is: 

Lemma 2 An uniplicit Markov kernel P on V satisfies the hypergroup property with respect to 
xq e V if and only if xq e H(P). 


Let us give succinctly some underlying arguments, since this is the only place in the paper where 
Definition ([2]) will play a role. 

Proof 

The reverse implication was observed in [T3j and the direct implication is a consequence of the 
considerations of Bakry and Huet [3], the uniplicit assumption is not even needed, as the following 
reminder show. Let P be a reversible Markov kernel P on V with an associated orthonormal basis 
of eigenvectors ipi, V\v\ as above. Assume that P satisfies the hypergroup property with 

respect to xq £ V. Let iebbe given and consider the kernel K x given by 


Vy,zeV, 


K x (y,z ) := 

fcs[|V|] 


< Pk(x)(p k (y)<Pk(z ) 

Vk(x o) 


y(z) 
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By assumption it is non-negative and for any fixed y £ V, we have by orthonormality, 

X K x{y-, z ) = 

zeV 


Thus K x is a Markov kernel. A similar computation shows that for any k £ [2, |V|], pk is also an 
eigenfunction of K x associated to the eigenvalue ipk(x)/<Pk(x o)- It follows that K x shares with P 
the same basis of eigenvectors, so that K x e IC(P). Furthermore, we have that for any l e [|V|]], 

K x [ipi\(x o) := YjK x {xq ,z)<pi(z) 

zeV 

= s s <Pk(x)<Pk(z)<Pl(z) m(z) 

zeV fc6[|V|l 

= <Pi(x) 

It implies that K x (x o, ■) = S x . So for any x e V, 5 X £ K.(P,xq). Taking into account that JC(P,x o) 
is always a convex set, we get that xq £ H(P). 


X K x(y,z)<Pi(z) 


zeV 


2 

ksmi ^(*o) ^ 

^i(x)y?i(?/) 


Remark 3 (a) The uniplicity assumption cannot be removed for the reverse implication of 

Lemma[2j Consider P the transition kernel of the random walk on V Z/(nZ), with n £ N\{1, 2}. 
At the end of Section 2.5 from [3], Bakry and Huet show that P does not satisfy the hypergroup 
property. Nevertheless, consider for v £ Z/(nZ), the translation by v kernel K defined by 

V x,y £ Z/(nZ), K(x,y) := S x+V (y) 


Clearly K £ JC(P) and K( 0, ■) = S v , so that 5 V £ /C(P, 0) for all v £ Z/(nZ). It follows that 
0 £ 'H(P). More precisely, we have 'H(P) = Z/(nZ). 

(b) The example in (a) satisfies the complex hypergroup property with respect to any 
point xq £ Z/(nZ) (see Proposition 2.10 of Bakry and Huet [3]), in the sense that we can find an 
unitary basis (tp\ , y> 2 , ■■■, P\v\) °f Ib 2 (li, C) consisting of eigenvectors of P such that tpk(x o) =|= 0 for 
all k £ I|P|], and 


V x, y, z £ V, 


So maybe the condition 


y Pk(x)Pk(y)Pk(z) 

U{P) 4= 0 


(7) 

( 8 ) 


is related to the complex hypergroup property. But here we will not investigate this question. We 
will mainly be interested in (J8|). seen as a generalization of the hypergroup property, because it 
could be considered for Markov kernels which are not reversible (or defined on abstract measurable 
spaces: ((HI) enables to avoid the technical difficulties related to the summations appearing in ((2j) 
or © when the state space is not finite). 

□ 

An irreducible birth and death kernel is necessarily uniplicit, so in the context of Theorem [TJ the 
hypergroup property for a Markov kernel P is equivalent to (JBD- We are thus lead to investigate 
the corresponding Markov commutator convex semi-group and will do it using general arguments. 
The two properties we will need are 
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Proposition 4 Assume that P is an irreducible Markov kernel and let p be its invariant proba¬ 
bility. Then we have 


\/xe'H(P), p(x) = min/i 

For the second property, we need to introduce the symmetry group Sp associated to P: it is 
the set of bijective mappings g : V —* V such that 

\fx,yeV, P{g(x),g(y)) = P(x,y) (9) 

For instance, one recovers the permutation group Sy of V if P is either the identity matrix I (no 
move is permitted) or the matrix whose all off-diagonal entries are equal to 1/(|P| — 1) (all “true” 
moves are equally permitted). Indeed Sp = Sy if and only if P is a convex combination of the two 
previous matrices, situations where all the elements of V are indistinguishable with respect to the 
evolution dictated by P. 

Proposition 5 Assume that P is an uniplicit Markov kernel and let xq,x\ e 'H(P). Then there 
exists g e Sp such that g(x i) = xq. Conversely, any g e S stabilizes TL{P), so that TL{P) is the 
orbit of any of its element under Sp. 

Another natural question in the finite birth and death setting is the transposition of the Achour- 
Trimeche’s theorem known in the continuous framework. We did not succeed in getting a really 
satisfactory answer in this direction. The next result is obtained by adapting the arguments of 
Bakry and Huet [3]. Let C be the subset of U e C such that U(x + 2) — U(x + 1) > U(x + 1) — 
U(x) + 21n(2) for all x e [0, N — 2]] (equivalently, U is the restriction to [0, NJ of a C 2 function on 
[0, N] satisfying U" > 21n(2)). Let C m be the subset of C consisting of monotonous mappings such 
that \U(N) — U(N — 1)[ a 117(1) — I7(0)| A 21n(2). Consider also C s the subset of C consisting of 
mappings symmetric with respect to N/ 2. 

Proposition 6 For any U e C m u C s , the Metropolis kernel Mp defined in © satisfies the hyper¬ 
group property. Thus the mapping C m u C s 3 U >—> Mp is a birth and death Metropolis procedure 
satisfying the hypergroup property. 

In the one-dimensional diffusive setting, the result corresponding to C m is due to Chebli [5]. 

Note that from Propositions [4] and [5] we deduce that in the symmetric situation, TL(Mp) = 
{0, TV}, and that in the monotonous case with U non-constant, TL(Mp) is the singleton consisting 
of the boundary element with the smallest weight with respect to the reversible measure pp. 

Remark [32] (d) gives another example of a generalized Metropolis procedure satisfying the 
hypergroup property for some convex potentials (more general than those considered in Proposi¬ 
tion [6]) . It would be very interesting to find other closed subsets C' a C for which we can find a 
generalized Metropolis procedure C' 3 U Pp satisfying the hypergroup property (or to describe 
C := {U s C : 'H(Mp) =|=0}). Especially to try to deduce the analogous results in the continuous 
framework, in order to recover Gasper’s example mm, see also Bakry and Huet (3) and Carlen, 
Geronimo and Loss [3]. 

From general considerations related to the Markov commutator convex semi-groups, we will 
also deduce the following criterion. Let P be a Markov kernel on the finite set V, consider G a 
subgroup of Sp and denote by = the equivalence relation it induces on V via 

V x,y eV, x = y <=> 3 g e G : g(x) = y 

Denote by V the set of equivalence classes for = and by it : V —> V the associated projection 
mapping. It is immediate to check that a Markov kernel P is well-defined on V through the formula 

\/ x,y eV, P(x,y) := P(x, 7r _1 (y)) 
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where x is any point of V such that tt(x) = x. This construction corresponds to a reduction of 
the symmetries of P. The next result shows that some properties of P are preserved under this 
operation. It will be used to check the hypergroup property of My for U e C m , knowing it for 
UeC s . 

Proposition 7 Assume that P is uniplicit and satisfies Condition ([8]). Then the same remains 
true for P. 

If the uniplicity of P could be removed from this statement and be replaced by the uniplicity of 
P (this is a weaker condition, since it will be seen in the proof of Corollary 1231 that the uniplicity of 
P implies that of P under the assumptions of Proposition [7|), this result would provide an abstract 
rewriting in the finite context of the Carlen, Geronimo and Loss method [4j. This conjectured 
extension seems quite challenging, some assumptions could be required on the subgroup G. Maybe 
they do not appear here, because when P is uniplicit, Sp is commutative, see Remark 1201 (a) below. 

In the next section we will study the Markov commutator convex semi-group in the general finite 
framework, obtaining in particular Propositions 01 [5| and [71 Advantage will be taken of the relations 
between the Markov commutator convex semi-group and the theory of Markov intertwining as it 
was developed by Diaconis and Fill [6j. In the last section we consider more specifically the birth 
and death case and prove Theorem |T] and Proposition [6j 

2 General properties 

This is the beginning of a systematic investigation of the Markov commutator convex semigroup 
/C(P) associated to a finite Markov kernel P. 

We start by recalling some elements of the theory of Markov intertwining due to Diaconis and 
Fill j6]. Let X := (. X n ) ne z+ and X := (. X n ) ne z + be two Markov chains, respectively on the finite 
state spaces V and V. The respective transition kernels are denoted P and P, and the initial 
distributions mo and mo- We say that X is intertwined with X through the Markov link A 
(which is a Markov kernel from V to V, seen asakxb matrix), if there is a coupling (A, X) such 
that the two following conditions are met: 

VneZ+, £(A[ 0jn j|A) = £(Aj 0) „j |X| 0)Tl j) (10) 

where as usual this identity of conditional laws has to be understood a.s. with respect to the 
probability measure underlying the coupling. The trajectorial notation X| 0)Tl ] := (A p ) pe j Q ri ] was 
used. 


VneZ + , C(X n |A| 0n ]) - A(X n ,-) (11) 

When these assumptions are satisfied, we write X <\ X and X is also said to be a dual chain of 
X through A. The notation X < X will notify there exists A such that X <\ X. 

We say that ( mo,P ) is intertwined with ( rho,P ) through the Markov link A if 

mo = moA and PA = A P (12) 

We denote this relation by (mo, P) <a (mo, P) and as above, (mo, P) < (mo, P) means there exists 
a kernel A such that m is satisfied. 

Diaconis and Fill [6] have shown that these notions of intertwining coincide, at least if X visits 
the whole state space V (in particular if P is irreducible): 

Proposition 8 With the above notations, we have 

(m 0 ,P) < A (m 0 ,P) => X <a X 
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Furthermore if for any x e V, there exists n e Z + such that P[X n = x\ > 0, then 

X<\X => (mo,P) <a (fho,P) 


Proof: 

More specifically, the construction of the coupling of X and X satisfying the conditions © 
and (1111) under the assumption (mo,P) <\ (mo,P) is described in Theorem 2.17 of Diaconis and 
Fill [Sj. The other implication can also be deduced from their considerations. For the sake of 
completeness, here are some arguments, directly based on the hypotheses (flOl) and CD- 

From CD. we deduce that for all n e Z + , C(X n \X n ) = A(X n ,-) so that by integration with 
respect to X n . we get C(X n ) = C(X n )A. In particular for n = 0, we obtain vtiq = moA. 

Let / and / two test functions defined respectively on V and V. For fixed n e Z+, we compute 
K[f (X n ) f (X n+ i)] in two ways. First, using (fTTT) and the Markov property of X, 

E[f(X n )f(X n+1 )] = E[f(X n )E[f(X n+1 )\X l0}n+n ]] 

= E[f(X n )A[f](X n+1 )\ 

= E[f(X n )(PA)[f](X n )] 

Second, using (flOl) and the Markov property of X, 

E[f(X n )f(X n+1 )] = E[E[f(X n )\X]f(X n+1 )] 

= E[E[f(X n )\X l0M ]f(X n+1 )] 

= E[E[f(X n )\X M ]P[f](X n )] 

= E[f(X n )P[f\(X n )\ 

= E[f(X n )E[P[f](X n )\X l0M ]] 

= E[f(X n )(AP)[f](X n )] 

Since this is true for any /, we deduce that a.s., 

(A P)[f](X n ) = (PA)[f](X n ) 

and due to the assumption on X, 

(A P)[f](x) = (PA) [/](*) 

Since it is true for all /, it follows that A P = PA. 


Remark 9 (a) The relation < is clearly reflexive (through the identity link) and it can be easily 
checked to be transitive (for instance at the level of the Markov chains, if X <\ X' and X' X" 
then X <aa' X"). Thus < is a pre-order, e.g. on the trajectorial laws of finite Markov chains (whose 
state space is a subset of N, to work on a defined set). It is then tempting to verify if it would 
not be an equivalence or an order relation. To see that < is none, consider Y the trivial Markov 
chain on a singleton. For any finite Markov chain X, we have Y < X, but X < Y is equivalent to 
the stationarity of X (namely the initial distribution of X is invariant for its transition kernel). It 
follows that < is neither symmetrical nor anti-symmetrical. Next, one can define an equivalence 
relation X ~ X' via X < X' and X' < X. On the corresponding equivalence classes, < defines a 
partial order relation, in some sense it should compare the difficulty of reaching an equilibrium (see 
also Remark 111 I below b The “stationarity” class of the trivial chain Y is minimal for this order. 

(b) Similar conditions are valid for the algebraic intertwining between couples consisting of a 
probability measure and a Markov kernel. If the finite state set V and the Markov kernel P are 
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fixed, we induce a relation on P(V) via m .q < rho if and only if ( mo,P ) < ( fho,P ). It can be 
transformed into an order relation on V(V)/ ~ by introducing an equivalence relation ~ as above. 
It heuristically corresponds to the proximity to the set of invariant measures for P, which are the 
minimal elements. Note that the semigroup ( P n ) n eZ + is non-increasing with respect to <, since 
we have (moP,P) <p (mo,P). 

a 

The main interest of associating a dual chain X to a given Markov chain X is that it enables to 
construct strong times (see for instance Diaconis and Fill [6], Fill [10] , Diaconis and Miclo [7] and 
m ). A stopping time r for X (with respect to a filtration containing the filtration generated by 
X) is a strong time if it is a.s. finite and if t and X T are independent. The basic principle of the 
construction is the following well-known result, whose proof is given for the sake of completeness. 

Lemma 10 Let (X,X) be a coupling satisfying (fTTT) . then this equality can be extended to any a.s. 
finite stopping time r for X, namely 

C(X r \X l0A ) = A(Xr, •) 

If in addition (X,X) satisfies (fTOlh then t is a strong time if A(X T ,-) is independent from r (for 
instance if \(X T , •) “is not really depending on” X T , e.g. if X T is a.s. equal to a fixed point). 


Proof 

The first assertion is an outcome of the notion of a stopping time: Let / be a function defined 
on V and F a bounded functional measurable with respect to the stopped trajectory Ap, T ]• We 
compute that 


nnxr)F] 


2 E[f(X n )Ft r=n ] 

nGZ-|_ 

2 E[E[f(X n )\X l0M ]Ft T=n ] 

neZ+ 

2 E[A[/] (X n )Ft T=n ] 

TIE Z+ 

E[A[/] (X t )F] 


where the second equality comes from the fact that Fl T=n is measurable with respect to X| 0)Tl j. 
The first wanted result follows, since this is true for all / and F as above. 

For the second assertion, note that (fTQl) implies that a stopping time for X is also a stopping 
time for X. Let / be a function defined on V and let g be a bounded measurable mapping on R + . 
Since r is measurable with respect to A”[o,t] , we have 


E [f{X T )g{r)] 


E[E[/(X t )|A [0)T jMt)] 
E[A [f](X T )g(r)\ 
E[A[f](X T )]E[g(T)] 
E[f(X T )]E[g(r)] 


where the third equality comes from the assumption made on A(X r , •). The independence of r and 
X T follows, since / and g were arbitrary. 


For the purpose of proving Proposition 0J we will only use the first part of the above lemma, 
even if the stopping times we will consider are indeed strong times. 

Indeed, it is time to come back to the Markov commutator convex semigroup fC(P) associated 
to an irreducible finite Markov kernel P. Denote X m ° := (Xj r>0 )t^o a Markov chain with P as 
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transition kernel and mo £ V(V) for initial law. From the definitions and Proposition [HI we have 
for any K e /C(P) and any initial distribution mo, X m ° K <k X m °. Thus it appears that xq £ 'H(P) 
if and only if for any mo £ V(V), there exists a Markov kernel K on V such that X m ° <x X x ° 
(as customary, X x ° is a shorthand for X Sx o ). In particular, if P is uniplicit, then P satisfies the 
hypergroup property if and only there exists xq e V such that for any mo £ V(V), X m ° < X x °. 
More generally, we get the following interpretation: 

V x e V, fC(P, x) = {m £ V(V) : X m < X x } 

All preliminaries are now in place for the 

Proof of Proposition [4] 

Consider xq £ 'H(P) and let x\ be any point of V. We want to show that n(xo) ^ n{x{). 

By definition of 'H(P), there exists K e /C(P) such that K(x o,-) = S X1 , so that from Proposition 
[HI X Xl <k X x °, i.e. we can construct a coupling of X x ° and X X1 satisfying (flOl) and (fTTT) with 
A := K. 

Let (r n ) ne z + be the sequence of stopping times for X x ° defined by iteration through To = 0 and 

V n £ Z+, r n+ 1 := inf{p > r n : X p = x 0 } 

According to Lemma ITOl for any n £ Z + , 

£ «l x foLi> - 

It means that each time X x ° is in xo, then X X1 is in x\. It remains to apply the ergodic theorem 
to get 

H(x 0 ) = 


where the (in)equalities are valid a.s. 

■ 

The elements of 'H(P) satisfies other optimization properties, they are for instance points from 
which it is the most difficult to reach equilibrium in the separation discrepancy sense: 

Remark 11 Recall that the separation discrepancy s(m,/i) between two probability measures 
on V is defined by 

/ x . ! m(x) 

:= supl- 

xeV t- L \ x ) 

(with the usual convention: r/0 = +co for any r > 0, but 0/0 = 0). 

A stationary time r for an irreducible Markov chain X m ° := (X™°) ne z + (mo still stands for 
the initial distribution) is a strong time such that A™ 0 is distributed according to the associated 
invariant measure fi. Aldous and Diaconis [2] have shown that if the transition kernel is aperiodic 
and irreducible, then for any initial distribution m q, there exists a stationary time r m ° associated 
to X m ° satisfying 


1 


lim 

n—>oo n + 

1 


Y 2 1*0 (** 0 ) 


pe[0,n] 


lim 

»oo n + 

Kx i) 


T 2 


pe[0,n]| 


V n £ Z+, P[r mo > n] = s(m 0 P n ,n) 

Furthermore r m ° is stochastically smaller than any stationary time associated to X m °. 


9 





The proof of Proposition [3] can be slightly modified to show that if xq e 'H(P), then t x ° is 
stochastically larger than r m ° for any initial distribution mo- Indeed, if K e /C(P) is such that 
K(x o, •) = mo, then considering a coupling of X x ° and X m ° realizing the relation X m ° <x X x ° , it 
appears that t x is a stationary time for X m °. It is a consequence of the fact that all the elements 
of /C(P) admit g for invariant measure, as it was seen in [T3] (only the irreducibility of P is needed 
for this property). The stochastic domination of r m ° by t x ° ensures that for any initial distribution 
mo (or equivalently for any Dirac mass m q = 5 X1 , with x\ in the state space V), 

V n e Z + , s(m 0 P n ,g) sc s(P n (x 0 , •), h) 


□ 


To go in the direction of Proposition [5j we begin by a simple technical result: 

Lemma 12 Let K and K' be two Markov kernels on V such that K'K = I, the identity kernel. 
Then there exist g e Sy such that 


V x, y e V, 


K(x,y) = 5 g{x) (y) 
K'(x,y) = S g -i {x) (y) 


Proof 

By contradiction, assume there exists x e V such that K(x,-) is not a Dirac mass. Then for 
any y e V, if K'(y,x) > 0 then K’K(y, •) cannot be a Dirac mass. This is not compatible with 
K’K = /, so we must have K'(y,x) = 0 for all y 6 V. It implies that K' is not invertible, in 
contradiction again with our assumption. So for any x e V, K(x, •) is a Dirac mass 8 g ( x ) f° r some 
g{x) e V. Since K is invertible, necessarily the mapping g is also invertible. The announced result 
follows at once. 


In addition, we will need the following consequence of the uniplicit assumption. 
Lemma 13 Assume that P is uniplicit, then for any fixed xq e TL(P), the affine mapping 

X (P) 3 K !->■ K(x 0 , •) e /C(P, x 0 ) 


is one-to-one. 

Proof 

Fix xq e TL{P) and mo £ V(V), it is sufficient to see there is exactly one matrix K solution to the 
equations 


K(xo, •) = m 0 

KP = PI< 

Indeed, consider g the reversible probability for P and let <£> 2 , •••, T\v\ be an orthonormal (in 
L2(/x)) basis of eigenvectors associated to P as in the introduction. By the commutation of K with 
P, this is also a basis of eigenvectors for K. Thus we can find numbers a±, 02 ,..., a\y\ such that 


v X, y e V, 

K(x,y) 

2 anpi(x)<pi{y)g(y) 
te[i,|V|] 

The first condition then reads 

Vye V, 

m 0 . . 

— (y) 

T 

X! a ivi( x o)vi(y) 
HL\v\l 
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namely (a^;( :c o))ie[i,|y|l are the coefficients of mo//x in the basis <P 2 ,—, P\v\)- Since <pi{xq) 4= 0 
for all l £ [1, |V|], according to Lemma [H we get that the ai,a 2 , ...,aiyi are uniquely determined. 

■ 

In particular if P is an uniplicit kernel satisfying the hypergroup property, then /C(P) is a simplex. 
It is sometimes possible to go further: 

Remark 14 In fact the above proof shows that if xq e V is any point such that <pi(x o) 4= 0 for all 
l £ [1, |V|], then the conclusion of Lemma fl3l still holds if P is uniplicit. If furthermore (jSJ) holds, 
then K{P) is a simplex as well as each of the JC(P, x ), for x £ V. 

Let TZ be the set of Markov kernels which are irreducible and reversible. It can be easily seen 
that the subset of elements of 1Z which are uniplicit and whose eigenvectors never vanish is a dense 
open subset of 7Z. But since TL , the subset of TZ consisting of kernels satisfying the hypergroup 
property, is very slim in TZ, it is no longer clear whether or not the subset of elements of TL which 
are uniplicit and whose eigenvectors never vanish is a dense open subset of TL. If it was true, it 
could be concluded that “generically”, K.{P,x) is a simplex for P e TL and x £ V . 

□ 

We have all the ingredients for the 

Proof of Proposition [5] 

Let be given xq,x\ £ TL(P). Then there exist K',K e }C(P ) such that 

K\xq, •) = S x i 

K(x i,-) = S x o (13) 

Thus we get that K'K (xo > 0 = <5xo- Since P is assumed to be uniplicit, we get from Lemma fl3l that, 
K’K e JC(P) is uniquely determined by this relation. It appears there is no alternative: K’K = 1. 
Lemma [12] enables to find a permutation g £ Sy such K is the Markov kernel induced by g. Note 
that (1131) translates into g(x i) = xq. The commutation of K and P then implies that 

Vx,y£V, P{g(x),y) = P(x,£f _1 (y)) 

which can be rewritten under the form ([9]) namely g £ Sp. The remaining assertions of Proposition[5] 
are straightforward. 


We are now going in the direction of Proposition [7] through a sequence of general arguments, 
in the hope they present in a clear way the problems one will encounter in trying to generalize 
it. We start by recalling some considerations from M- A Markov kernel A from V to V can be 
interpreted as an operator sending any function / defined on V to the mapping A[/] defined on V 
by 

A [f](x) := A (x,x)f{x) 

xeV 

Let p, be a probability measure given on V and consider g := pA its image by A. Then A can 
be seen as an operator from L 2 (/r) to !L 2 ( / u) (because A[/] is /1-negligible if / is //-negligible). It 
enables to define A* its dual operator from L 2 (/Z) to L 2 (//), which is Markovian in the sense that 

A* [Ip] = l v 

V/eL 2 (/r), / 5= 0 => A*[/] > 0 

where the relations have to be understood p- or g-a.s. 

If p and g give positive weights to all points of V and V respectively, then A* can be seen as 
a Markov kernel from V to V. 
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Remark 15 In the intertwining framework, similar considerations are valid for P and P, in order 
to define P* and P*, seen as Markov operators on L 2 (/x) and L 2 (/x), when fl and /x are invariant 
probability measures, respectively for P and P, i.e. fiP = fl and pP = p. Thus to be able to 
consider P* and P* as Markov matrices, it is convenient to make the following assumption: we 
say that the couple (P, A) is positive, if P admits a positive invariant measure fl and if p := flA is 
also positive. Up to reducing V and V respectively to the support of ft and p, it is always possible 
to come back to this case. Note that the commutation relation 

PA = A P (14) 

implies that p is an invariant probability for P. 

Under the hypotheses that (P, A) is positive and that (fHl) is satisfied, we get a dual commu¬ 
tation relation: 


P* A* = A *P* 

If furthermore we assume that (rriQ,P) <a ( rho,P) and that 

777-0 AA* = 7770 (15) 


then we get the intertwining relation 

(m 0 ,P*) <a* (mo.P*) 

The reversibility assumption for P with respect to fl amounts to P* = P and similarly for P. These 
considerations lead to a restricted symmetry property for the relation <: (mo, P) <\ (rho, P) 
implies (rho,P) <\* (?77o,P) under the assumptions that (P, A) is positive, that P and P are 
reversible and that (j!5p is satisfied. This is an instance of the equivalence relation ~ introduced 
in Remark [9j 

We give below in Remark (22] (b) a natural condition under which (1151) is true. 

□ 

Beyond reversibility or uniplicity, an important assumption will be 

AA*PA = PA (16) 

(this condition for the Markov kernel P is an analogue of (1151) for the probability measure 777-0)■ 
Define 


P := A* PA (17) 

From (I16p . it appears that P and P are intertwined through A, namely (1141) is satisfied. We can go 
further in the exploration of Kf(P) with the help of IC(P): the next result is a slight modification 
of Proposition 3 of (11], where IC(P ) was replaced by the smaller set 

K(P, A) := {K e K(P) : AA*KA = KA} 

namely the set of elements from /C(P) satisfying the condition (1161) . It is also a convex semigroup 
and in Lemma fl9l some conditions will be given so that JC(P,A) = JC(P). 

Lemma 16 Assume that P is reversible with respect to fl and that (1161) holds, then we have 

A*1C(P)A cz K(P) 
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Proof 

For any K £ /C(P), we compute that 


A*KAP 


A*KPA 
A* PK A 
A*PAA*KA 
PA*KA 


(18) 


where for the third equality, we have used the dual relation of (1161) asserting that A*P*AA* = 
A*P *, namely A*PAA* = A *P, since P = P*. Relation (fl8l) shows that A*KA belongs to JC(P). 


Condition (1161) seems quite strange at first view and we would have liked to only work with 
(|14l) . Lemma I2T1 below will show this is possible when A is deterministic. 

The motivation for Proposition 3 of pH] was to give an abstract version in the finite context of 
a method of Carlen, Geronimo and Loss [4] to recover the hypergroup property in the context of 
Jacobi polynomials, result initially due to Gasper PHH2). The underlying idea is equally conveyed 
by Lemma [16] to prove (]8]h one tries to find a Markov model (or several ones) P. above P in the 
sense of intertwining (namely according to the order relation induced by < as in Remark [9]), such 
that JC(P) is relatively easy to apprehend. If it appears that /C(P) is quite big, then the inclusion 
of Lemma m gives an opportunity to show that IC(P) is also big, leading us toward (0). But to 
guess such a nice Markov kernel P from P may not be an easy task! That is why we now go in 
the reverse direction, starting with P. In particular it is natural to wonder when does 

U{P) 4 = 0 ( 19 ) 

imply ([8]). Before partially answering this question, let us mention a construction of Markov kernels 
satisfying (HHD. 

Remark 17 (a) Any irreducible Markov kernel P on {0,1} satisfies (|8]h Indeed, let ju be the 
associated invariant measure and by symmetry, assume that ^(0) ^ /r( 1). Then there exists 
a e [— fi(0)/, 1] such that P = al + (1 — a)/r, where /i is seen as the Markov kernel whose two 
rows are equal to /u. Any Markov kernel K := bl + (1 — 6)/r, with b £ [—/r(0)/^(l), 1], belongs to 
K,{P). Taking b = — /u(0)//i(l) (respectively b = 1), the first row of K is (0,1) (resp. (1,0)). This 
shows that 0 £ 'H(P). 

(b) If Pi and P 2 are two Markov kernels on V\ and V 2 , then Pi (x) P 2 is a Markov kernel on 
V\ x V 2 . It appears that /C(Pi)(g)/C(P 2 ) cz /C(Pi<S)P 2 ) and in particular H(Pi) x H{P 2 ) c: H(Pi<g)P 2 )- 

(c) From the two points above, it follows that if P is an irreducible Markov kernel on {0,1}, 
then for any N £ N, P := P® N satisfies (1191) . Such Markov kernels were used in m to recover the 
hypergroup property of the biased Ehrenfest model (initially due to Eagleson [9]). 

□ 

We introduce now three assumptions which are helpful in the direction of deducing |8]) from (1 19 [) . 
First, the surjectivity of A as an operator on V(V): 

V(V) A = V(V) (20) 

Second, the determinism of A on 'H(P): 

V x 0 e 'H(P), A(x 0 ,-) = ^(^0) ( 21 ) 

where 7r(xo) is an element of V. Denote the image by n of 'H(P). The last hypothesis is 

an extension of (1161) to the identity kernel: 

AA*A = A (22) 


13 


Note that by multiplication on the left or on the right by A*, this implies that AA* and A*A are 
projection operators in their respective spaces L 2 (/i) and L 2 (/a). 

Proposition 18 Assume P is uniplicit and (fT6l) . (fl9l) . (1201) . Oil) and (f22ll hold. Then dH) is 
satisfied with P given by (fT7|) and more precisely 7r(P(P)) cz P{P). 

Before proving this statement, let us give another important consequence of uniplicity. If P is a 
Markov kernel on V, let A(P) be the algebra generated by P, namely the set of finite combinations 
of the form a$I + a\P + a 2 -P 2 + • • • + a n P n , where n e Z + and ao, ai, « 2 , ■ a n e R. Denote also by 
IC(V) the convex set of Markov kernels on V . 

Lemma 19 Assume that P is uniplicit. Then we have 

JC(P) = A(P) n K{V) 

In particular if (l22j) holds and P is uniplicit and satisfies ca, then the latter property can be 
extended to /C(P): 


M K e K.(P), AA*KA = KA 


(23) 


Proof 

Let (ipi, <f2, T\v\) b 0 an orthonormal basis of eigenvectors of P and let Ai, A2, Aiy| be the 

corresponding eigenvalues. Consider K e JC(P), by commutativity, (</q, ip2, T\v\) is a l so a basis 
of eigenvectors of K , denote by 6 1, 62, ..., 6\y\ the associated eigenvalues. Since the Ai, A2, ..., Ajy| 
are all distinct, we can find a polynomial R of degree at most \V\ such that 

VZe l\V\j, R(A t ) = d t 


It follows that K = R(P), showing that /C(P) c: A{P) n IC(V). The reverse inclusion is obviously 
always true. 

The second assertion of the lemma comes from the fact that (1161) implies that 


VneN, AA*P n A = P n A 


Indeed, this is shown by iteration on n £ N: 

AA*P n+1 A 


AA* P n (PA) 
AA*P n (AA*PA ) 
(AA*P n A)A*PA 
(P n A)A* PA 
P n (AA*PA) 
P n+1 A 


The case n = 0 corresponds to assumption (1221) . So we get that for any A e A(P), 

AA*AA = AA 


from which we deduce (1231) if P is uniplicit. 


Remark 20 (a) The inclusion A(P) n KfiV) cz IC(P) is always true, but it is not necessarily 
an equality. Indeed, if /C(P) = A(P) n JC(V), then the elements of JC(P) commute. But Sp 
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is naturally included into JC(P) via the representation Sp 3 g >—> T g e /C(V) where T g is the 
deterministic Markov kernel given by 

V X £ V, Tg(x, •) = &g(x) 

If the elements of /C(P) commute, then Sp is itself commutative. This is not always true, one 
can e.g. consider the transition kernel of the random walk generated by the transpositions on the 
permutation group 5 jv, with IV ^ 3. 

(b) The example of Remark 1171 is equally such that Sp is not commutative for N ^ 3. Indeed, 
consider for a e Sn the mapping g on {0,1}' V obtained by shuffling the coordinates according to 
a. Then T g , defined as above, belongs to Sp. It follows that Sp contains Sn as a subgroup and 
thus cannot be commutative. Despite the fact that P is not uniplicit, it was proven in m that 
the conclusion of Proposition [7] is true, where G := Spr. In this case P is a birth and death chain 
and is thus uniplicit. 

(c) Even if it outside the finite framework, the example of the Laplacian L on the sphere 
SS N cz M Ar+1 , with N ^ 1, is also such that IC(L) (rigorously, one should define it with respect 
to the associated heat kernel at a positive time) is not commutative, because Sp contains all the 
isometric transformations of SS N , namely the orthogonal group O (N + 1). Note nevertheless that 
since JC(L) is big, the same is true for 'H(L): it is the whole sphere! We mention this case, because 
it plays an important role in Carlen, Geronimo and Loss |3). At first view, it has some similarities 
with the situation of (b) above: L is not uniplicit but formally the conclusion of Proposition [7] is 
true when G is the subset of 0(N + 1) conserving the norm of the n first coordinates of M. N , with 
n £ IN- 1], 

(d) Despite what we just said, it seems there is an important difference between the cases 
(b) and (c) above. In the latter it can be checked that /C(L,A) =)= /C(L), while in the former we 
think that /C(P,A) = /C(P). That is why Proposition fl8l could be applied to such P without the 
assumption of uniplicity, thus explaining the validity of Proposition [7] for this example. In [14j . it 
was rather used that 'H(P,A) =)= 0, where 7i(P,A) := {x £ (0,1} W : 5 X JC(P, A) = V({0. 1} jV )}- 

□ 


With these observations, we can come to the 

Proof of Proposition 1181 

Consider xq £ 7f(P). Taking into account (l23l) . we have 

5 X0 AA*K{P)A = 5 X0 K{P) A 

= v(y) a 

= V{V) 

where we used (1201) . Assumption (1211) ensures that <5a; 0 A = cL-^), so we get 

6 Hxo) A*)C(P)A = V{V) 

Finally we use Lemma [16] to see that 

V{V) cz 6 <X0) 1C(P) 

which is the wanted result. 

■ 

It is time now to consider the purely determinist case for A, which simplifies most of the previous 
hypotheses. More precisely, assume that there exists a surjective mapping ir from V to V such 
that A is given by 

V xeV, A(x, ■) := S n(x) (•) (24) 
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Lemma 21 Under (1241) . if P is a Markov kernel on V such that (P, A) is positive and if P is 
a Markov kernel on V satisfying the intertwining relation (1141) (called Dynkin’s condition in this 
situation, see w , then (USD, (USD. (ED and ED are true. Furthermore, A* A = I and P is given 

by (ED. 

Proof 

Under Assumption (1241) . it was seen in Lemma 5 of [Hj that AA* is the conditional expectation 
with respect to the sigma-algebra T generated by 7r. 

Consider (flfil) . which amounts to 

V/e L 2 ( m ), AA*PA[f] = PA[f] 

Note that the relation PA[/] = AP[/] = P\f] o 7 r implies that PA[/] is T-measurable for any 
/ 6 L 2 (/r), so the above equality holds. Similarly, using that A[/] is T-measurable for any / e L 2 (/r), 
we get (1221) . It follows that A* A is a projection in L 2 (^r) and to see that A* A = I, it is sufficient 
to check that A*A is injective. So let / e L 2 (^t) be such that A*A[/] = 0, we get that 

/ ott = A[/] = AA*A[/] = 0 

Since 7r is surjective, it appears that / = 0. 

It follows that P is given by (fT71) : 


P = A*AP = A*PA 

Condition (f24l) implies obviously (12T1) . and (1201) due to the surjectivity of it. 


Remark 22 (a) The deterministic case (1241) is not the only one where (1161) is satisfied. Indeed, 
assume that 7r is surjective but not injective in (1161) . Let P be a Markov kernel on V such that 
(P, A) is positive. From LemmeEJ it appears that AA* = I, so we get 

AA*P*A* = P*A* 

namely (1161) for P* and A*. But since 7r is not injective, the conditional expectation A*A is not 
the identity, thus A* does not satisfy (ED- 

(b) Under Assumption (ED) Condition (fT5l) is also simple to understand: it asks that the 
conditional expectations with respect to T (the sigma-algebra generated by 7 r) with respect to fi 
and mo coincide. Namely, if (A\,...,A [) is the partition of V generating T (corresponding to the 
equivalence relation between x,y e V given by 7 r(x) = 7 r(y)), then m 0 satisfies (fT5l) if and only if 
it is of the form 


V x e V, mo(x) 


E 


fi(Ak) 


t Ak (x)n(x) 


where ( 01 ,..., o/) is a probability measure on [ZJ. 

□ 

From Proposition [IS] and Lemma El w e deduce: 

Corollary 23 Assume that the Markov kernel P is uniplicit and that 'H(P) =(= 0. Let P be a 
Markov kernel satisfying Relation ED with a link A given by (1241) with it surjective. Then P is 
uniplicit and satisfies (HD as well as the hypergroup property. 
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Proof 


The above results show that 'H(P) =|= 0. According to Lemma (2} it is then sufficient to check that 
P is uniplicit. By duality, we have P* A* = A *P* = A*P , it implies, via the equality A*A = I of 
Lemma l2ll 


P * = P*A*A 
= A* PA 
= P 

where we used (|16l) . which is true due to Lemma ED again. This shows that P is reversible. 
Consider 6 an eigenvector of P and ip, <p two associated eigenvectors. From the intertwining relation 
m we get 


OA = P[AM] 

and similarly for <p. By uniplicity of P, A[</?] and A[(5] are then co-linear. Remembering that A is 
injective by surjectivity of 7 r, we get that ip and <p are co-linear as wanted. 

■ 

Proposition [7] is itself a consequence of the previous corollary. Indeed, it is immediate to check 
that P. P and it given before Proposition [7] satisfy the intertwining relation (|14l) where A is defined 
by m- 

To end this section, we mention some (upper) semi-continuity properties associated to the 
Markov commutator convex semi-groups, suggesting the easy handling of this notion. Note that 
for any Markov kernel P on the finite set V and x e V, the sets /C(P) and JC(P,x) are compact 
subsets, respectively of the set of Markov kernels and of probability measures on V (endowed with 
the topologies inherited from those of “ and 10), themselves being compact. As usual, consider 
the Hausdorff topology on the compact subsets of a compact set, it turns it into a compact set 
itself. The following properties are elementary and their proofs are left to the reader. 

Lemma 24 Let {P n )ne N be a sequence of Markov kernels on V converging to P. We have for any 
xeV, 


lira sup IC(P n ) cr K.(P) 
n —>00 

lim sup K,{P n ,x) c: JC(P,x) 
n —>00 

lim sup 'H(Pn) cr T~L{P) 

n —>00 

As a consequence, the set of Markov kernels P on V satisfying the generalized hypergroup 
property dHJ) is closed. 

Let us remark that the above last inclusion can be strict. Anticipating a little on the next 
section, consider V := {0,1} and let (U n ) ne ^ be a sequence of functions on V satisfying U n ( 0) > 
U n ( 1) for all n £ N and lim n ^.oo U n = 0. With the notation of (f4|) . we have 

VneN, H{M Un ) = {0} 

lim M Un = M 0 

n —>00 

n{M 0 ) = {0,1} 
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3 On the discrete Achour-Trimeche’s theorem 


Here the specific birth and death situation is considered in a more detailed way. The diffusive 
Achour-Trimeche’s theorem will be partially translated into the discrete case, but first we show it 
cannot be extended to all convex potentials. It corresponds respectively to the proofs of Proposition 
[6] and Theorem [I] 

The previous section provided all the ingredients necessary to the 

Proof of Theorem |Tj 

Recall the setting described in the introduction. Theorem|T]is proven by a contradictory argument: 
assume there exists a generalized Metropolis procedure C 3 U >—* Py such that Py satisfies the 
hypergroup property for all U £ C. 

Since N ^ 2, there exists U £ C such that U( 0) = 17(1) and which is not symmetric with respect 
to the mapping [0, IV] 3 x > N — x. For e > 0, consider the function U e defined on [0, IV] by 


V x £ [0, IV], U e (x) 


U (0) T e , if x = 0 
U(x) , otherwise 


It is clear that U e £ C. Furthermore, due to the convexity of U and the assumption 17(0) = U(N), 
it appears that U e ( 0) > U e (x ) for all x £ [IV]. By Definition [3]), the minimum of gy e is only 
attained at 0. Taking into account Proposition 01 it follows that 'H(Py e ) = {0}. By letting e > 0 
go to zero, Lemma [Ml implies that 0 £ 'H(Pu). The same reasoning, where the value of U(N) is 
a little increased, equally enables to conclude that N e 'H(Pjj). So we get that {0, TV} cr 'H(Py). 
Since Pu is a birth and death, it is uniplicit, and according to Proposition 01 we can find g e Sp v 
with g(0) = N. Note that under the action of any element of the symmetry group Sp, the graph of 
the transitions permitted by P is preserved (not taking into account the self-loops). For birth and 
death transitions on [0, IV], this graph is the usual linear graph structure of [0, IV]. There are only 
two graph morphisms preserving this structure, the identity and the mapping [0, IV] 3 x >—> N — x. 
So we end up with a contradiction, because g can be neither of them. 


We now come to the proof of Proposition 01 We begin by reducing the problem to symmetric 
potentials. Recall that the classical Metropolis procedure C 3 U >—» My is defined by Q. 

Lemma 25 If for all N e N\{1} ; the Metropolis kernel Mjj satisfies the hypergroup property for 
U e C s , then it is also true for U £ C m . 

Proof 

This is a consequence of Proposition [71 Indeed, let U e C m , up to reversing the discrete segment 
[0, IV], assume that U is non-increasing. Consider V ■= [0, 2IV + 1], on which we construct the 
potential U by symmetrization of U with respect to IV + 1/2. Note that U is convex and more 
precisely that U £ C m , due to the assumption U(N — 1) — U(N) ^ 21n(2), which implies 

U{N + 2) - U(N + 1) ^ 2 ln(2) 

= U(N + 1) -U(N) + 2 ln(2) 

^ U(N) — U(N — 1) + 41n(2) 

Associate to U the classical Metropolis kernel My on V. Let G = Sm 0 be the group consisting of 
the identity and of the involution [0, 2 N + 1] £ x >—> 2 N + 1 — x. The reduction presented before 
Proposition [7] transforms My into My (up to a modification of the constant Tijj given in [6]), 
which has no impact on the hypergroup property, since it amounts to change Mjj into a convex 
combination of My and I). Again, since My is a birth and death chain, it is uniplicit. Thus 
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Proposition [7] enables to see that Mjj satisfies Condition dSD, because by assumption this is true 
for Mjj. Applying once more Lemma [2] shows that Mjj satisfies the hypergroup property. 


Remark 26 In the above proof, another symmetrization could have been considered: let V := 
|0,2IV] and U be obtained from U by symmetry with respect to N (U being non-increasing). 
Applying the same arguments under the relaxed assumption U(N — 1) — U(N ) ^ ln(2) (implying 
U(N + 1) — U(N) ^ U(N) — U(N— 1) + 2 ln(2)) , we get in the end that Mu satisfies the hypergroup 
property, where Mu is defined as Mu in Q, but with Mq replaced by the exploration kernel Ado 
given by 

^ f 1/2 , if \x — y\ = 1 and x =)= N 

V x =|= y e [0, IV], M 0 (x, y ) := <1 , if x = N and y = N — 1 

[ 0 , otherwise 

□ 


It remains to prove that for U e C s , Mu satisfies the hypergroup property. We did not find 
general arguments to obtain this result. Instead, we will adapt to the discrete case the proof 
presented by Bakry and Huet [3j in the context of symmetric one-dimensional diffusions. 

Proposition 27 For any U e C s , the Metropolis kernel Mu satisfies the hypergroup property, with 
respect to the points 0 and N. 

By uniplicity of Mu and its symmetry with respect to the mapping 


s : |0, N} 3 x h> N — x (25) 

it is sufficient to check that 0 6 FL(Mu) for given U e C s . Let us consider more generally the problem 
of showing that 0 e T-L(P), when P is an irreducible birth and death Markov transition on [0, NJ, 
left invariant by the symmetry s. By definition, it amounts to show that for any given probability 
mo £ P([0, TV]]), there is a Markov kernel K commutating with P and such that AT(0, •) = mo- 
This question is equivalent to the fact that a wave equation starting from a non-negative condition 
remains non-negative, as it was shown by Bakry and Huet [3] in the diffusive situation and in 
Remark 6 of nu for the discrete case. More precisely, there is a unique matrix K commuting with 
P such that K (0, •) = mo (due to the uniplicity of Mu , see the proof of Lemma fl3l or Lemma 10 of 
El), our problem is to check that its entries are non-negative. Denote L = P — I, the Markovian 
generator matrix associated to P and 


V x,ye [0,.ZV], k{x,y) 


K(x,y) 

h(v) 


The commutation of K with P can be rewritten as the wave equation 

V x,y e [0, NJ, L (1) [/c](x,y) = L (2) [k](x, y) (26) 

where for i e {1, 2}, L® stands for the generator acting on the 7-th variable as L. 

Consider the discrete triangle 


A := {(x, y) e [0, IV] 2 : x y and x ^ N — y} 


For zq := (xq,7/o) e A, let p ZQ := {p zo {n)) ne ^o, 2 y 0 ] be the path defined by iteration through 


Pzo (°) 

V 77 e [0,2yo - 1], p-(n + l) 


p zo (n ) — (0,1) , if ?7 is even 

p~ (n) — (1, 0) , if ?7 is odd 
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Note that the path p~ Q stays in A and that p~ o (2y 0 ) belongs to the segment [0, N} x {0}. 
Similarly, for zq £ A, we define the path p+ := (pt 0 (n))ne[o,2y 0 J’ which is symmetric to pf Q with 
respect to the axe x = xq. The interest of these paths is: 

Lemma 28 Assume that the mapping k : [0,N] 2 —> R satisfies the wave equation (1261) . Then for 
any z 0 := ( x 0 ,y 0 ) e A, we have, if y 0 ^ 1, 

u(zo,Pz 0 (l))k(z 0 ) = [u)(z 0 ,p~ o ( 1)) -w(p“(l),p“(2)) -w(p+(l),p+(2))]A:(p“(l)) 

+^(p7 0 (2y 0 - l),pf o (2y 0 ))k(pf o {2y 0 )) + w(p+(2y 0 - T),pt 0 (fiyo))k(pf 0 (2y 0 )) 

+ 2 [“(Pzot™ ~ 1 )>^r 0 ( n )) - 

nep,22/0-1] 

+ 2 [“(ptoin-V’Ptin))-u(pt 0 (.n),pt 0 (n + l))]k(pt 0 (n)) 

nep,22/ 0 -i] 

where for any (z,z') := ((x,y), ( x',y')) e [0, IV], we take 

, f p(x)p{y)L(x,x') , if z’- ze {(1,0), (-1,0)} 

J • 1 p(x)p(y)L(y,y') , if z f - z e {( 0,1),(0,-1)} 


Proof 

From the reversibility of L with respect to p, we deduce the discrete integration by part formula: 
for any functions f,g on JO, IV], we have 

p[f L [g]\ = - 2 g( x ) L ( x iy)if(.y)-f( x )][9(y)-g(x)] 

0 ^x<y^N 

In particular, if / is the indicator function of a segment [<?, r] c [0, IV], we get 

/4 1 lq,r} L [g]] = [g(r + 1) - g(r)]p{r)L(r,r + 1) + [g(q - 1) - g(q)]p{r)L(q,q - 1) (27) 

with the convention (Neumann boundary) that g{— 1) = g(0) and g(N + 1) = g(N). 

For zq £ A, define the discrete triangle 

A(z 0 ) := {(x, y) £ [0, IV] 2 : x ^ y - y 0 + x 0 - 1 and x < -y + y 0 + x 0 - 1} (28) 

Applying ([27)1 horizontally and vertically, we get, for k satisfying the wave equation ([26]) . 

o - j.® 2 [ 1 a(„)U ( 1 | -£ ( 2 , )M] 

= Xj d H e )x(e)u(e) (29) 

eedA(zo) 

where the boundary BA(zo) of A ( 20 ) is defined by 

dA(z 0 ) := {(z,z’)e A(z 0 ) x ([0, IV] 2 \A(^)) : z' - z £ {(1,0), (-1, 0), (0,1), (0,-1)}} 
and where for any e := (. z,z') £ dA(zo), u/(e) was defined in the statement of the lemma and 

dk(e) := k(z ') — k(z) 

, - I 1 , if ^-^£ {(1,0), (-1,0)} 

X( , ) ' 1-1 , if z' — z e {(0,1), (0, —1)} 

It is easy (but a picture can help) that (1291) can written under the form 

0 = [k(p~(n + 1)) - k(p-(n))]u(p-(n),p~(n + 1)) 

ns [ 0 , 22 / 0 - 1 ] 

+ 2 [ k (Pz 0 ( n + 1 )) ~ k (Pz 0 ( n )')] UJ (Pzo( n )’Pzo( n + 1 )) ( 30 ) 

ne[l, 22/0-I] 
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Observe that the first sum can be transformed (via discrete integration by parts, also known as 
Abel’s trick) into 

2 i k (Pz 0 ( n + !)) - KP^n))}^{p~ 0 {n),p- Q {n + 1 )) 

ne[ 0 ,22/0-1] 

= fc(P^( 2 2 /o))w(p“ (2yo - l),p7 o (2y 0 )) - k(zo)u(z 0 ,pf 0 (l)) 

2 k(p-(n))[u(p-(n),p-(n + 1)) - u(p~(n - l),p" (n))] 

ne[l, 22/0-1] 

A similar manipulation is possible for the second sum (1301) and we end up with the result announced 
in the lemma. 


As a consequence, we get 

Proposition 29 Assume that P is a birth and death transition kernel on [0, NJ such that 

V z := (x, y) e A, P(y — l,y) ^ P{x, x — 1) + P(x, x + 1) 

V z := (x, y) e A, P(y, y — 1) ^ P(x — 1, x) a P(x + 1, x) 

where A is the “interior” of A: 

A := {(x, y) e [0, AJ 2 : x ^ y — 1 and x ^ N — y — 1} 

Let k be a solution of (1261) such that fc(-,0) is non-negative. Then k remains non-negative on A. 


Proof 

We begin by showing that the condition of the proposition (which can be written identically in 
terms of L), implies that for any zq := (xq, yo) e A and n e |2, 2yo — 1], we have 


u(p Z0 (n-l),p Z0 (n))-uj(p Z0 (n),p Z0 (n + l)) ^ 0 

u(pt( n “ 1 )^Pt 0 ( n )) ~ u{pt 0 ( n )iPt 0 ( n + !)) > 0 

It amounts to see that for any (x,y) e A, 

| v((x,y),(x,y ~ 1)) -uj({x,y - 1), (x - l,y - 1)) Ss 0 

1 u((x,y),(x,y - 1)) - u}((x,y - 1), (x + l,y - 1)) ^ 0 

and that for any (x,y) e A, 

| w((x,y),(x - l,y)) -u((x - l,y),(x - l,y - 1)) ^ 0 

\ u((x,y),(x + l,y))-u((x + l,y),(x + l,y-l)) > 0 

Concerning (13TT) . let £ e {—1, +1}, we have 

v((x,y), (x,y - 1)) -u((x,y- 1), (x + s,y- 1)) 

= p(x)p(y)L(y, y — 1) — p{x)p(y - 1 )L(x, x + e) 
= P(x)p(y - l)[L(y - l,y) - L{x,x + e)] 


(31) 


(32) 


where we used the reversibility of p with respect to L. By the first assumed inequality, we have in 
particular P(y — 1, y) ^ P(x, x — 1) v P(x, x + 1), so that the last r.h.s. is non negative, as wanted. 
The treatment of (|32p is similar, taking into account the second assumed inequality: 


w((x, y), (x + £, y )) - u((x + s,y),(x + e,y - 1)) 

= h{x)p(y)L(x, x + e) - p(x + e)p(y)L(y, y - 1) 
= p(x + e)p(y)[L(x + e,x) - L(y,y - 1)] 

^ 0 
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Next we want to show that 


w (^o,^ 0 (l)) - w(p Zo (l),p ao (2))-w(p+(l),p+(2)) Ss 0 
Writing (x,y) ■- p~ ( 1), it means that 

w((x,y + l),(x,y))-w((x,y),(x - l,y))-u;((x,y),(x + l,y)) > 0 

namely 


^(x)p{y)[L(y, y + 1) - L(x, x + 1) - L(x, x - 1)] ^ 0 

condition which is satisfied by the first assumed inequality of the lemma (since zq = (x,y + 1 )). 
Thus all the coefficients in front of values of k in the equality of Lemma [28] are non-negative. 
Assume that k does not remain non-negative on A. We can then consider j/o the minimal value of 
y e [0,1V]] such that there exists y < x < N — x such that k(x,y ) < 0. Next, let xo the minimal 
value of x e [y, N — y\ such that k(x, yo) < 0. In particular, zq := (xo, yo) £ A and k(zo) < 0, fact 
which is in contradiction with the equality of Lemma 1281 whose r.h.s. is non-negative. 

■ 

Assume now that P is furthermore left invariant by the symmetry s defined in (1251) . One 
important consequence is that the conclusion of Proposition [29] is valid on the whole discrete 
square [0,1V] 2 : 

Proposition 30 Assume that the birth and death transition P on [0,1V] is invariant by s. Let k 
be a solution of (1261) . Then k is left invariant by the following symmetries of the discrete square: 

[0,1V] 2 a (x,y) ^ (y, x) 

[0, IV] 2 3 (x, y) - (IV — x, IV — y) 

[0, IV] 2 3 (x, y) - (IV — y,N — x) 

As a consequence, if k is non-negative on A, then it is non-negative on [0,1V] 2 . 

Proof 

Consider k : A —> M satisfying the wave equation (1261) on A. Extend k to the discrete triangle 
A 2 := {(x,y) £ [0,1V] 2 : y < N — x} by symmetry with respect to the line y = x. Let us check 

that k satisfies (126|) on A 2 := {(x, y) e [0, IV] 2 : y < N — x — 1}. By symmetry of P, it is obvious 

on the image of A by the mapping (x,y) >—> (y, x). Thus it is sufficient to show that (1261) is also 
valid on the points (x,x) e A 2 . Indeed, we compute that 

[fc] (x, x) — [&] (x, x) 

= L(x, x + 1 )(k(x + 1, x) — k(x, x)) + L(x, x — 1 )(k(x — 1, x) — k(x, x)) 

—L(x,x + 1 )(k(x,x + 1) — k(x,x)) — L(x,x — l)(A:(x,x) — k(x,x — 1)) 

= 0 

due to the construction by symmetrization. 

Next we can extend k to [0, IV] 2 by symmetrization with respect to the line y = N — x. The same 
arguments as above show that this extension satisfies (|26l) on [0,1V] 2 . Observe that the mapping 
k constructed in this way is left invariant by the symmetries presented in the lemma. 

Now consider k : [0,1V] 2 -*la solution of ([26D . Let k be its restriction to A. By the above 
construction, we extend k to [0, IV] 2 into a function also satisfying (1261) . Note that k(-,0) = k(-, 0), 
so by uniqueness of the solution of (1261) given its value on the discrete segment {0} x [0,1V], we 
get k = k. 
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Consider the following assumption called (H): the mappings [0, [IV/2J] 3 x <—* 2 x P(x,x + 1) 
and [0, [IV/2J] 3 x >—> P{x + l,x) are respectively non-increasing and non-decreasing. 

Our main result about a partial extension of Achour-Trimeche’s theorem to the discrete setting 
can be stated as 

Theorem 31 Assume that the birth and death transition P on [0, NJ is invariant by s and that 
(H) is fulfilled. Then P satisfies the hypergoup property with respect to 0 and N. 


Proof 

According to Proposition [301 it is enough to check that (H) implies the assumption of Proposi¬ 
tion [29l Note that in the case where N is odd, by symmetry of P through s, we have P((N — 
l)/2, (N + l)/2) = P((N + l)/2, (N — l)/2). When N is even, we rather get P(N/2, N/2 + 1) = 
P(N/ 2, N/2 — 1) and P(N/2 — 1, N/2) = P(N/2 + 1, N/2). In both situations, it appears that (H) 
leads to 


V y e [0, [N/2\ - 1], V x e \y + 1, [N/ 2j], 

By symmetry of P through s, it follows that 
Vl/elO, [N/2\ - 1], V x e \y + 1,1V - y - 1], 
which is the assumption of Proposition [29l 


2 P(x + 1, x) ^ 2P(x, x + 1) 5% P(y , y + 1) 
P{y + 1, y) ^ P(x + 1, x) P(x, x + 1) 


P(x + 1, x) + P(x, x + 1) ^ P(y , y + 1) 
P(y + 1, y) < P(x + 1, x) A P(x, x + 1) 


As a simple corollary we obtain Proposition [27l because Py satisfies (H) if U e C s . Indeed, 
this condition asks for the mappings [0, [A"/2j] 3 x ^ U(x) — U(x + 1) + 21n(2)x and [0, [iV/2j] 3 
x i —> U(x + 1) — U(x) to be respectively non-increasing and non-decreasing. This is valid, by the 
definition of C given before Proposition [6l 

Remarks 32 

(a) One can replace the exploration kernel Mq given in Q by Mq defined via 

_ C 1/2 , if \x — y\ = 1, x =|= 0 and x N 

V x 4 = y 6 [0,7V], M 0 (x,y) := l 1 , if (x, y) = (0,1) or (x, y) = (N,N — 1) 

[ 0 , otherwise 

The corresponding Metropolis procedure C s 3 U > My (where My is defined as in (J4|), with Mo 
replaced by Mo) also satisfies the hypergroup property, because (H) is equally true for these birth 
and death Markovian transitions. 

Taking into account Remark [26l this result can be extended to the Metropolis procedure C m u C s 3 

U ^ My. 

Nevertheless, due to the fact that 0 f C, we are not able to recover that Mo satisfies the 
hypergroup property, as it was shown in Example 7 of [14| . 

(b) For U e C, consider the variant classical Metropolis procedure My given by 

v X 4= y £ [0, NJ, My(x, y) := M 0 (x, y) exp(-(I/(y) - U(x))+) (33) 

Simulations suggest that My satisfies the hypergroup property if the convex function U is either 
monotonous or symmetric with respect to the middle point of the discrete segment [0, NJ. It would 
be a nice discrete extension of the Achour-Trimeche’s theorem, but we have not been able to prove 
this conjecture. 
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(c) The previous conjecture is not true if in (1351) . M 0 is replaced by M 0 (given by (|5|)). Indeed, 
consider the case N = 2 and U = 0. Let k be the solution of the corresponding wave equation (1261) 
starting from k(-, 0) := (0,1,0). Equation (1261) at point (1,1) writes: 

i(/c(0,0) — /c(l,0)) + ^(fc(0,0) — fe(l,0)) = ±(fc(l,l)-fc(l,0)) 

namely fc(l,l) = —fc(l,0) = —1. So non-negativity is not preserved by (1261) and by consequence 
Mo does not satisfy the hypergroup property. 

In particular the assumption U e C is not merely technical in Proposition [6] Note this obser¬ 
vation is not in contradiction with the conjecture given in (b). 

(d) Theorem [31] enables to construct other examples of birth and death Metropolis procedures 
satisfying the hypergroup property. E.g. consider the exploration kernel Mq given by 


V®*ye[0,JV], Mq(x, y) 


l/2 XA ( N x ) , if \x — y\ = 1 

0 , otherwise 


Let C s be the set of potentials U symmetric with respect to N /2 and such that U e C, where 

V :r e [0, IV], U{x) := U(x) + ln(2)(x a (IV - as)) 

Define the Markov kernel Mjj via 

V x + y e [0,IV], M v (x,y) := M 0 (x,y)exp(-(U(y) - U(x))+) 

For U 6 C s , Mjj satisfies (H) and admits yjj, the Gibbs measure defined in ([3]), as reversible 
measure. Thus C s 3 U >—> Mjj is a generalized birth and death Metropolis procedure satisfying 
the hypergroup property. The proof of Lemma [25] enables to deduce a similar construction for 
monotonous potentials (for instance for convex potentials U such that |0,IV] 3 x >—> U{x) + ln(2)x 
is non-increasing). 

Note that the potentials from C s are more general than those from C s , since the former ones can 

grow linearly (away from the middle point of the state space), while the latter ones must grow 

quadratically. The drawback is that Mjj is further away from the continuous model d 2 — U'd than 
Mu defined in (j4|). 

□ 


To finish, let us mention a non-negativity preservation on edges rather than on vertices under 
a natural relaxation of the assumption of Proposition [29j 

Proposition 33 Assume that P is a birth and death transition kernel on [[0, IV] such that 

M z'.= {x 1 y)e A, P(y — l,y) ^ P(x, x — 1) v P(x, x + 1) 

V z := (x, y) e A, P(y,y — 1) ^ P(x — 1, x) a P(x + 1, x) 

Let k be a solution of (1261) such that k(-, 0) is non-negative. Then for any (x,y) e A, we have 

(x, y + 1) £ A => k(x, y) + k(x, y + 1)^0 

(x + 1, y) e A fi(x)k(x, y) + y(x + 1 )k{x + 1, y) ^ 0 


Proof 
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Note that the equality of Lemma ESI can be rewritten under the form: 


^(z 0 ,p Zo (l))[k{z 0 ) + Kp z 0 (i))] 

= (2y 0 - l),p~ 0 {2y 0 ))k{p-{2y 0 )) + u(p+(2y 0 - l),p+ o (2y 0 ))k(p+ o {2y 0 )) 

+ 2 M?Lo( n - - u(p- 0 (n),p~ 0 (n + 1 ))]fc(p“(n)) 

ne 11,22/0—11 
nell,22 /q-I] 


So the first implication of the above proposition can be shown as in the proof of Proposition 1291 
which enables to see that k(zo) + k{p~ (1)) ^ 0, if yo ^ L 

For the second implication, rather consider for zq e A such that z$ + (1,0) e A, the path 
pt 0 : = (pt 0 ( n ))nelO, 22/0 + 1] defined by iteration through 


Pt 0 (°) 

V n e [0, 2yol, pt 0 (n + 1) 


Pt 0 ( n ) + (1,0) 

pt 0 H - (0,1) 


, if n is even 
, if n is odd 


The set A(zq) defined in il28ll must be modified into the “almost triangle” 


A(z 0 ) := {(x, y) e [0, IV] 2 : x y - y 0 + x 0 - 1 and x -y + y 0 + x 0 ) 


The proof of Lemma [28] then leads to 


u(zo,p ZQ {l))k(z 0 ) + w(p+(l),j>+(2))A:(p+(1)) 

= u(p~ o (2y 0 - l),p“ (2y 0 ))fc(p“ (2j/o)) + u(p+ o (2y 0 ),p^ o (2y 0 + l))k(p+ o (2y 0 + 1)) 

+ 2 [ w (P«( n_ !),P2 0 ( n )) _w (Pzo( n )’Ao( ra + 1 ))] fc (P^( Jl )) 

nep,22/0-1] 

+ 2 [ w (P*>( n “!),P^( n ))- w (P^ 0 ( n )’Pii^ n + i ))] fe (pi( n )) 

ns[2,22/ 0 ] 

The proof of Proposition 1291 now implies that uj(zo, p~ Q (l))k(zo) + cu(p+(l),p+(2))fc(p+(1)) ^ 0, 
namely y(x 0 )k(z 0 ) + p(x 0 + 1 )k(x 0 + l,y 0 ) ^ 0. 


The advantage of Proposition [33] over Proposition [29] is that it enables to recover by approxi¬ 
mation (with N going to infinity) the result of Bakry and Huet [3) concerning the preservation of 
non-negativity by the wave equation in the context of the diffusive Achour-Trimeche theorem. 
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